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We propose a network characterization of combinatorial fitness landscapes by adapting the notion of inherent 
networks proposed for energy surfaces. We use the well-known family of NK landscapes as an example. In 
our case the inherent network is the graph whose vertices represent the local maxima in the landscape, and the 
edges account for the transition probabilities between their corresponding basins of attraction. We exhaustively 
extracted such networks on representative NK landscape instances, and performed a statistical characterization 
of their properties. We found that most of these network properties are related to the search difficulty on the 
underlying NK landscapes with varying values of K. 
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I. INTRODUCTION 

Difficult combinatorial landscapes are found in many important problems in physics, computing, and in common everyday life 
activities such as resource allocation and scheduling. For example, spin-glass systems give rise to such energy landscapes which 

x- 

are characterized by many local minima and high energy barriers between them. These landscapes generally show frustration, 
i.e. frozen disorder where the system is unable to relax into a state in which all constraints are satisfied. In completely different 
fields, such as combinatorial optimization, similar hard problems also arise, for example the well-known traveling salesman 
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problem and many others. 

In order to understand the reasons that make these problems difficult to optimize, a number of model landscapes have been 
proposed. One of the simplest yet representative example is Kauffman's family of NK landsdcapes [1]. The NK family of 
landscapes is a problem-independent model for constructing multimodal landscapes that can gradually be tuned from smooth to 
rugged, where the term "rugged" is intuitively related to the degree of variability in the objective function value in neighboring 
positions in configuration space. The more rugged the landscape, the higher the number of local optima, and the landscape 
becomes correspondingly more difficult to search for the global optimum. The idea of an NK landscape is to have N "spins" or 
"genes", each with two possible values, or 1. The model is a real stochastic function $ defined on binary strings s e {0, 1}^ 
of length N, $ : s — » R + . The value of K determines how many other spin values in the string influence a given spin 
Si, i — 1, . . . , N. The value of $ is the average of the contributions fa of all the spins: 



1 N 



i=l 

By increasing the value of K from to N — 1, NK landscapes can be tuned from smooth to rugged. For K = all 
contributions can be optimized independently which makes $ a simple additive function with a single maximum. At the other 
extreme when K = N — 1 the landscape becomes completely random, the probability of any given configuration of being the 
optimum is 1/ (JV + 1), and the expected number of local optima is 2 / (N + 1). Intermediate values of K interpolate between 
these two cases and have a variable degree of "epistasis", i.e. of spin (or gene) interaction QJJ] . 

The K variables that form the context of the fitness contribution of gene Sj can be chosen according to different models. The 
two most widely studied models are the random neighborhood model, where the K variables are chosen randomly according to 
a uniform distribution among the N — 1 variables other than Sj, and the adjacent neighborhood model, in which the K variables 
are those closest to Sj in a total ordering s±, S2, ■ ■ ■ , sjv (using periodic boundaries). No significant differences between the two 
models were found in terms of global properties of the respective families of landscapes, such as mean number of local optima 
or autocorrelation length JjJ, [2|] . Similarly, our preliminary studies on the characteristics of the NK landscape optima networks 
did not show noticeable differences between the two neighborhood models. Therefore, we conducted our full study on the more 
general random model. 

The NK model is related to spin glasses, and more precisely to p-spin models |3|,|4|], where p plays a role similar to K. In 
spin glasses the function analogous to $ is the energy H and the stable states are the minima of the energy hypersurface. 

In this study we seek to provide fundamental new insights into the structural organization of the local optima in combinatorial 
landscapes, particularly into the connectivity of their basins of attraction. Combinatorial landscapes can be seen as a graph whose 



vertices are the possible configurations. If two configurations can be transformed into each other by a suitable operator move, 
then we can trace an edge between them. The resulting graph, with an indication of the fitness at each vertex, is a representation 
of the given problem fitness landscape. A useful a simplification of the graphs for the energy landscapes of atomic clusters 
was introduced in ylla]. The idea consists in taking as vertices of the graph not all the possible configurations, but only those 
that correspond to energy minima. For atomic clusters these are well-known, at least for relatively small assemblages. Two 
minima are considered connected, and thus an edge is traced between them, if the energy barrier separating them is sufficiently 
low. In this case there is a transition state, meaning that the system can jump from one minimum to the other by thermal 
fluctuations going through a saddle point in the energy hyper-surface. The values of these activation energies are mostly known 
experimentally or can be determined by simulation. In this way, a network can be built which is called the "inherent structure" 
or "inherent network" in Q5l, lZ[| - 

We propose a network characterization of combinatorial fitness landscapes by adapting the notion of inherent networks de- 
scribed above. We use the family of NK landscapes as an example. In our case the inherent network is the graph where the 
vertices are all the local maxima and the edges account for transition probabilities between maxima. We exhaustively extract 
such networks on representative small NK landscape instances, and perform a statistical characterisation of their properties. 



Our analysis is inspired, in particular, by the work of [5 
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on energy landscapes, and in general, by the field of complex net- 



works [8]. A related work can be found in H, where the case of lattice polymer chains is studied. However, the notion of an 
edge there is very different, being related to moves that bring a given conformation into an allowed neighboring one. Similar 
ideas have been put forward in physical chemistry to understand the thermodynamics and kinetics of complex biomolecules 
through the network study of their free-energy landscapes [10]. It should also be noted that our approach is different from the 



barrier-tree representations of landscapes proposed by Stadler et al. (see, for example, 111 110 . 

The next section describes how combinatorial landscapes are mapped onto networks, and includes the relevant definitions and 
algorithms used in our study. The empirical analysis of our selected NK landscape instances is presented in the following two 
sections; one devoted to the study of basins, and the other to the network statistical features. Finally, we present our conclusions 
and ideas for future work. 



II. LANDSCAPES AS NETWORKS 



Many natural and artificial systems can be modeled as networks. Typical examples are communication systems (the Internet, 
the web, telephonic networks), transportation lines (railway, airline routes), biological systems (gene and protein interactions), 
and social interactions (scientific co-authorship, friendships). It has been shown in recent years that many of these networks 



exhibit was has been called a small-world topology ||8t], in which nodes are highly clustered yet the path length between them 
is small. Additionally, in several of these networks the distribution of the number of neighbours (the degree distribution) is 
typically right-skewed with a "heavy tail", meaning that most of the nodes have less-than-average degree whilst a small fractions 
of hubs have a large number of connections. These topological features are very relevant since they impact strongly on networks' 
properties such as their robustness and searchability. 

To model a physical energy landscape as a network, Doye [5] needed to decide first on a definition both of a state of the system 
and how two states were connected. The states and their connections will then provide the nodes and edges of the network. For 
systems with continuous degrees of freedom, this was achieved through the 'inherent structure' mapping. In this mapping each 
point in configuration space is associated with the minimum (or 'inherent structure') reached by following a steepest-descent 
path from that point. This mapping divides configurations into basins of attraction surrounding each minimum on the energy 
landscape. 

Our goal is to adapt this idea to the context of combinatorial optimization. In our case, the vertexes of the graph can be straight- 
forwardly defined as the local maxima of the landscape. These maxima are obtained exhaustively by running a best-improvement 
local search algorithm (see Fig. Q]) from every configuration of the search space. The definition of the edges, however, is a much 



more delicate matter. In our initial attempt IU2I1 we considered that two maxima i and j were connected (with an undirected and 
unweighed edge), if there exists at least one pair of direct neighbors solutions Sj and Sj, one in each basin of attraction (bi and 
bj). We found empirically on small instances of NK landscapes, that such definition produced densely connected graphs, with 
very low (< 2) average path length between nodes for all K. Therefore, apart from the already known increase in the number of 
optima with increasing K, no other network property accounted for the increase in search difficulty. Furthermore, a single pair 
of neighbours between adjacent basins, may not realistically account for actual basin transitions occurring when using common 
heuristic search algorithms. These considerations, motivated us to search for alternative definitions of the edges connecting local 
optima. In particular, we decided to associate weights to the edges that account for the transition probabilities between the nodes 
(local optima). More details on the relevant algorithms and formal definitions are given below. 

A. Definitions and Algorithms 



Definition: Fitness landscape II ill . 
A landscape is a triplet (S, V, f) where S is a set of potential solutions i.e. a search space, V : S — > 2 s , a neighborhood 
structure, is a function that assigns to every s G S a set of neighbors V(s), and / : S — > R is a fitness function that can be 
pictured as the height of the corresponding solutions. 
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In our study, the search space is composed by binary strings of length N, therefore its size is 2 . The neighborhood is defined 
by the minimum possible move on a binary search space, that is, the 1-move or bit-flip operation. In consequence, for any given 
string s of length N, the neighborhood size is \ V(s)\ = N. 

The HillClimbing algorithm to determine the local optima and therefore define the basins of attraction, is given below: 

Choose initial solution s € S 



Definition: Local optimum. 
A local optimum is a solution s* such that Vs e V(s*), f(s) < f(s*). 

The HillClimbing algorithm defines a mapping from the search space S to the set of locally optimal solutions S*. 
Definition: Basin of attraction. 

The basin of attraction of a local optimum i 6 Sis the set bi = {s e S | HillClimbing(s) = i}. The size of the basin of 
attraction of a local optimum i is the cardinality of bi. 

Notice that for non-neutral fitness landscapes, as are NK landscapes, the basins of attraction as defined above, produce a 
partition of the configuration space S. Therefore, S = U ie s*^i an d Vi e £ Vj ^ i, bi n bj = 

Definition: Edge weight. 

For each pair of solutions s and s , let us define p(s — > s ) as the probability to pass from s to s with the given neighborhood 
structure. In the case of binary strings of size N, and the neighborhood defined by the single bit-flip operation, there are N 
neighbors for each solution, therefore: 

if s € V(s) , p(s -> s') = jj and 
if s V(s) ,p(s -+s') = 0. 

We can now define the probability to pass from a solution seStoa solution belonging to the basin bj, as: 



repeat 

choose s G V(s) such that f(s ) = max xeV (s) f( x ) 
if/(s)</(s')then 



s «— s 
end if 

until s is a Local optimum 



FIG. 1 : HillClimbing algorithm. 




s'ebj 



Notice thatp(s -> bj) < 1. 
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Thus, the total probability of going from basin bi to basin bj is the average over all s £ bi of the transition probabilities to 
solutions s £ bj : 

p(bi -> 6j) = -jj- ^p( s 

jjbi is the size of the basin 6j. We are now prepared to define our 'inherent' network or network of local optima. 

Definition: Local optima network. 
The local optima network G = (S*,E) is the graph where the nodes are the local optima, and there is an edge £ E with 
weight Wij = p(bi — > bj) between two nodes i and j if p(bi — > bj) > 0. 

Notice that since each maximum has its associated basin, G also describes the interconnection of basins. 

According to our definition of edge weights, Wij = p(bi — > 6j) may be different than Wji = p(bj — > 6j). Thus, two weights 
are needed in general, and we have an oriented transition graph. 

The following two definitions are relevant to the discussion of the boundary of basins. 

Definition: Boundary of a basin of attraction. 
The boundary B(b) of a basin of attraction b can be defined as the set of configurations within a basin that have at least one 
neighbor's solution in another basin b . 

Definition: Interior of a basin of attraction. 
Conversely, the interior 1(b) of a basin is composed by the configurations that have all their neighbors in the same basin. 
Formally, 

B(b) = {s£b | 3 &V b, 3 s £ b, 3 e ss > £ E}, 

1(b) =6-5(6) 

in. EMPIRICAL ANALYSIS OF BASINS 

In order to avoid sampling problems that could bias the results, we used the largest values of N that can still be analyzed 
exhaustively with reasonable computational resources. We thus extracted the local optima networks of landscape instances with 
N = 14, 16, 18, and K = 2, 4, 6, N — 2, N — 1. For each pair of N and K values, 30 randomly generated instances were 
explored. Therefore, the networks statistics reported below represent the average behaviour of 30 independent instances. 
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Besides the maxima network, it is useful to describe the associated basins of attraction as these play a key role in search 
algorithms. Furthermore, some characteristics of the basins can be related to the optima network features. The notion of the 
basin of attraction of a local maximum has been presented before. We have exhaustively computed the size and number of all 
the basins of attraction for N = 16 and N = 18 and for all even K values plus K = N — 1. In this section, we analyze the 
basins of attraction from several points of view as described below. 



A. Global optimum basin size versus K 

In Fig.|2]we plot the average size of the basin corresponding to the global maximum for N = 16 and N = 18, and all values 
of K studied. The trend is clear: the basin shrinks very quickly with increasing K. This confirms that the higher the K value, 
the more difficult for an stochastic search algorithm to locate the basin of attraction of the global optimum 
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FIG. 2: Average of the relative size of the basin corresponding to the global maximum for each K over 30 landscapes. 



B. Number of basins of a given size 

Fig. [3] shows the cumulative distribution of the number of basins of a given size (with regression line) for a representative 
instances with N = 18, K = 4 . Table U shows the average (of 30 independent landscapes) correlation coefficients and 
linear regression coefficients (intercept (a) and slope (/3)) between the number of nodes and the basin sizes for instances with 
N = 16, 18 and all for all the studied values of K. Notice that distribution decays exponentially or faster for the lower K and 
it is closer to exponential for the higher K. This observation is relevant to theoretical studies that estimate the size of attraction 



basins (see for example [13]). These studies often assume that the basin sizes are uniformly distributed, which is not the case 
for the NK landscapes studied here. From the slopes /3 of the regression lines (table one can see that high values of K give 
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FIG. 3: Cumulative distribution of the number of basins of a given size with regression line. A representative landscape with TV = 18, and 
K — 4 is visualized. A lin-log scale is used. 

rise to steeper distributions (higher f3 values). This indicates that there are less basins of large size for large values of K. In 
consequence, basins are broader for low values of K, which is consistent with the fact that those landscapes are smoother. 

C. Fitness of local optima versus their basin sizes 

The scatter-plots in Fig. |4] illustrate the correlation between the basin sizes of local maxima (in logarithmic scale) and their 
fitness values. Two representative instances for N = 18 and K = 4, 8 are shown. Notice that there is a clear positive correlation 
between the fitness values of maxima and their basins' sizes. In other words, the higher the peak the wider tend to be its basin 
of attraction. Therefore, on average, with a stochastic local search algorithm, the global optimum would be easier to find than 
any other local optimum. This may seem surprising. But, we have to keep in mind that as the number of local optima increases 
(with increasing K), the global optimum basin is more difficult to reach by a stochastic local search algorithm (see Fig. [2j. 
This observation offers a mental picture of NK landscapes: we can consider the landscape as composed of a large number of 
mountains (each corresponding to a basin of attraction), and those mountains are wider the taller the hilltops. Moreover, the size 
of a mountain basin grows exponentially with its hight. 

IV. GENERAL NETWORK STATISTICS 




We now briefly describe the statistical measures used for our analysis of maxima networks. 

The standard clustering coefficient [8] does not consider weighted edges. We thus use the weighted clustering measure 
proposed by 11411 . which combines the topological information with the weight distribution of the network: 
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TABLE I: Correlation coefficient (p), and linear regression coefficients (intercept (a) and slope (J3)) of the relationship between the basin size 
of optima and the cumulative number of nodes of a given (basin) size ( in logarithmic scale: log(p(s)) = a + /3s + e). The average and 
standard deviation values over 30 instances, are shown. 
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where = J2 3 ^i a «™ = 1 if w„ m > 0, a nm = if w nm = and fc, = Sj^j a y • 

For each triple formed in the neighborhood of the vertex i, c w (i) counts the weight of the two participating edges of the vertex 
i. C w is defined as the weighted clustering coefficient averaged over all vertices of the network. 

The standard topological characterization of networks is obtained by the analysis of the probability distribution p(k) that a 
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FIG. 4: Correlation between the fitness of local optima and their corresponding basin sizes, for two representative instances with N = 18, 
K = 4 (top) and K — 8 (bottom). 



randomly chosen vertex has degree k. For our weighted networks, a characterization of weights is obtained by the connectivity 
and weight distributions pi n (w) and p ou t (w) that any given edge has incoming or outgoing weight w. 

In our study, for each node i, the sum of outgoing edge weights is equal to 1 as they represent transition probabilities. So, an 
important measure is the weight wa of self-connecting edges (remaining in the same node). We have the relation: wu + Si = 1. 
The vertex strength, Sj, is defined as = YljeV(i)-U} Wi i 



where the sum is over the set V(i) — {i} of neighbors of i 11411 . 
The strength of a node is a generalization of the node's connectivity giving information about the number and importance of the 
edges. 



Another network measure we report here is disparity 1 1 411 Yz(i), which measures how heterogeneous are the contributions of 
the edges of node i to the total weight (strength): 
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The disparity could be averaged over the node with the same degree k. If all weights are nearby of Sj/fc, the disparity for 
nodes of degree k is nearby 1/k. 

Finally, in order to compute the average shortest path between two nodes on the optima network of a given landscape, we 
considered the expected number of bit- flip mutations to pass from one basin to the other. This expected number can be computed 
by considering the inverse of the transition probabilities between basins. In other words, if we attach to the edges the inverse of 
the transition probabilities, this value would represent the average number of random mutations to pass from one basin to the 
other. More formally, the distance (expected number of bit-flip mutations) between two nodes is defined by dij — 1 /wij where 
Wij = p(bi — > bj). Now, we can define the length of a path between two nodes as being the sum of these distances along the 
edges that connect the respective basins. 

TABLE II: NK landscapes network properties. Values are averages over 30 random instances, standard deviations are shown as subscripts. 
n v and n e represent the number of vertexes and edges (rounded to the next integer), C w , the mean weighted clustering coefficient. Y represent 
the mean disparity coefficient, d the mean path length (see text for definitions). 
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A. Detailed Study of Network Features 

In this section we study in more depth some network features which can be related to stochastic local search difficulty on the 
underlying fitness landscapes. Table HI1 reports the average (over 30 independent instances for each N and K) of the network 
properties described. n v and n e are, respectively, the mean number of vertices and the mean number of edges of the graph for a 
given K rounded to the next integer. C w is the mean weighted clustering coefficient. Y is the mean disparity, and d is the mean 
path length. 

1. Clustering Coefficients 

The fourth column of table [IT] lists the average values of the weighted clustering coefficients for all N and K. It is apparent 
that the clustering coefficients decrease regularly with increasing K for all N. For the standard unweighed clustering, this would 
mean that the larger K is, the less likely that two maxima which are connected to a third one are themselves connected. Taking 
weights, i.e. transition probabilities into account this means that either there are less transitions between neighboring basins for 
high K, and/or the transitions are less likely to occur. This confirms from a network point of view the common knowledge that 
search difficulty increases with K. 

2. Shortest Path to the Global Optimum 

The average shortest path lengths d are listed in the sixth column of table [IT] Fig. [5] (top) is a graphical illustration of the 
average shortest path length between optima for all the studied NK landscapes. Notice that the shortest path increases with N, 
this is to be expected since the number of optima increases exponentially with N. More interestingly, for a given N the shortest 
path increases with K, up to K = 10, and then it stagnates and even decreases slightly for the N — 18. This is consistent 
with the well known fact that the search difficulty in NK landscapes increases with K. However, some paths are more relevant 
from the point of view of a stochastic local search algorithm following a trajectory over the maxima network. In order to better 
illustrate the relationship of this network property with the search difficulty by heuristic local search algorithms, Fig. (bottom) 
shows the shortest path length to the global optimum from all the other optima in the landscape. The trend is clear, the path 
lengths to the optimum increase steadily with increasing K. 
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FIG. 5: Average distance (shortest path) between nodes (top), and average path length to the optimum from all the other basins (bottom). 



3. Outgoing Weight Distribution 



Here we report on the outgoing weight distributions p ou t(w) of the maxima network edges. Fig. [6] shows the empirical 
probability distribution function for the cases N = 16 and N = 18 (logarithmic scale has been used on the x-axis). The case 
N = 14 is similar but is not reported here because it is much more noisy for K = 2 and 4 due to the small size of the graphs 
in these cases (see table [II]). One can see that the weights, i.e. the transition probabilities to neighboring basins are small. The 
distributions are far from uniform and, for both N = 16 and N = 18, the low K have longer tails. For high K the decay is 
faster. This seems to indicate that, on average, the transition probabilities are higher for low K. 

We have already remarked that the approach taken in O, la, l2D is different in that edges between two optima either exist or not; 
in other words the notion of transition probability is absent. However, it is worth recalling that Doye et al. |5|,|g] found that their 
inherent networks were of the scale-free type with the global minimum being often the most connected node. The landscape is 
thus essentially a "funnel" meaning that the system can relax to the global minimum from almost any other local minimum. In 
our language, we would say that this kind of landscape is an 'easy" one to search. In contrast, the inherent networks found by 
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FIG. 6: Probability distribution of the network weights Wij for outgoing edges with j ^ i in log-scale on x-axis. Averages of 30 instances for 
each N and K are reported. 
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for lattice polymer chains are of the small-world type but they show a fast-decaying degree distribution function. 



4. Disparity 

Fig. |7]depicts the disparity coefficient as defined in the previous section for N — 16, 18. An interesting observation is that 
the disparity (i.e. inhomogeneity) in the weights of a node's out-coming links tends to decrease steadily with increasing K. 
This reflects that for high K the transitions to other basins tend to become equally likely, which is another indication that the 
landscape, and thus its representative maxima network, becomes more random and difficult to search. 

When K increases, the number of edges increases and the number of edges with a weight over a certain threshold increases 
too (see fig. |6]l. Therefore, for small K, each node is connected with a small number of nodes each with a relative high weight. 
On the other hand, for large K, the weights become more homogeneous in the neighbourhood, that is, for each node, all the 
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neighboring basins are at similar distance. 

If we make the hypothesis that edges with higher weights are likely to be connected to nodes with larger basins (an intuition 
that we need to confirm in future work) then, as the larger basins tend to have higher fitness (see Fig.|4]i, the path to higher fitness 
values would be easier to find for lower K than for larger K. 
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FIG. 7: Average disparity, Y2, of nodes with a given degree k. Average of 30 independent instances for each TV and K are reported. The curve 
1/k is also reported to compare to random case. 



J. Boundary of basins 

Fig. [8]shows the averages, over all the nodes in the network, of the weights wu (i.e. the probabilities of remaining in the same 
basin after a bit-flip mutation). Notice that the weights wu are much higher when compared to those mj,j with j ^ i (see Fig. [6]). 
In particular, for K = 2, 50% of the random bit-flip mutations will produce a solution within the same basin of attraction. These 
average probabilities of remaining within the same basin, are above 12% for the higher values of K. Notice that the averages 
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are nearly the same regardless the value of N, but decrease with the epistatic parameter K. 

The exploration of new basins with the random bit-flip mutation seems to be, therefore, easier for large K than for low K. 
But, as the number of basins increases, and the fitness correlation between neighboring solutions decreases with increasing 
K, it becomes harder to find the global maxima for large K. This result suggests that the dynamic of stochastic local search 
algorithms on NK landscapes with large K is different from that with lower values of K, with the former engaging in more 
random exploration of basins. 




2 4 6 8 10 12 14 16 18 



K 

FIG. 8: Average weight u„ according to the parameters TV and K. 

Table [III] gives the average number of configurations in the interior of basins (this statistic is computed on 30 independent 
landscapes). Notice that the size of the basins' interior is below 1% (except for N — 14, K — 2). Surprisingly, the size of the 
basins' boundaries is nearly the same as the size of the basins themselves. Therefore, the probability of having a neighboring 
solution in the same basin is high, but nearly all the solutions have a neighbor solution in another basin. Thus, the interior basins 
seem to be "hollow", a picture which is far from the smooth standard representation of landscapes in 2D with real variables 
where the basins of attraction are visualized as real mountains. 

6. Incoming Weights Distribution 

It is also of interest to study the distribution of the weights of edges impinging into a given node p, n (w). However, a plot of this 
quantity is not very informative. We prefer to show in Fig.|9]the average values over 30 independent lansdcapes for each value 
of N and for the whole interval of K. The general trend for all values of N is that the average weight of the incoming transitions 
into a node quickly decreases with increasing K. This means that it is more difficult to make a transition to a local maximum 
when K is large. This agrees with the fact that the relative basins' size is a rapidly decreasing function of K (see Fig. |2j. In fact, 
there is a strong positive correlation between the basins' size and the weights of the transitions into the corresponding maximum, 
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TABLE III: Average (on 30 independent landscapes for each N and K) of the mean relative sizes of the basins interiors. 
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FIG. 9: Average value of the weights of incoming transitions into maxima nodes for N = 14, 16, 18 and for whole K interval. 

i.e. as the basin becomes larger, the number of transitions into it increases too. This is shown on the scatter-plots (Fig. [Tot . 
The correlation follows approximately a power-law, the regression lines are also visualized. The correlation coefficient for all 
plots is high (above 0.97). If we hypothesize that the incoming weights are proportional to the size of the basin, i.e. that edges 
between nodes are randomly distributed over the search space, the sum of the incoming weights for a basin bi should be 
This theoretical line is visualized on the scatter-plots (Fig.fTOb. Notice that the difference between the theoretical and regression 
lines, is higher for low values of K. For large K, the weights are given almost only by the size of basins. This is not the case for 
small K values, where the fitness correlation between neighboring solutions is high |2|]. This explains why the hypothesis does 



18 



not hold in this case. So, the incoming weights could be deduced from the size of basins and the fitness of solutions belonging 
to the basin. 
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FIG. 10: Scatter-plots of the sum of weights into basins vs. the basin size for N = 18 and various K values. Log-log scale. The regression 
line of each scatter-plot is shown. The theoretical curves, under the hypothesis that the weights are proportional, to the size of basin are also 
given. 



V. CONCLUSIONS 

We have proposed a new characterization of combinatorial fitness landscapes using the well-known family of NK landscapes 
as an example. We have used an extension of the concept of inherent networks proposed for energy surfaces [5] in order to 
abstract and simplify the landscape description. In our case the inherent network is the graph where the nodes are all the 
local maxima and the edges accounts for transition probabilities (using the 1 -flip operator) between the local maxima basins 
of attraction. This mapping leads to oriented weighted graphs, instead of the more commonly used unordered and unweighed 
ones [5]. We believe that the present representation is closer to the view offered by a Monte Carlo search heuristic such as 
simulated annealing which produces a trajectory on the configuration space based on transition probabilities according to a 



Boltzmann equilibrium distribution! 15]. We have exhaustively obtained these graphs for N = {14, 16, 18}, and for all even 
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values of K, plus K = N — 1, and conducted a network analysis on them. The network representation of the NK fitness 
landscapes has proved useful in characterizing the topological features of the landscapes and gives important information on the 
structure of their basins of attraction. In fact, our guiding motivation has been to relate the statistical properties of these networks, 
to the search difficulty of the underlying combinatorial landscapes when using stochastic local search algorithms (based on the 
bit-flip operator) to optimize them. We have found clear indications of such relationships, in particular: 

The clustering coefficients: suggest that, for high values of K, the transition between a given pair of neighboring basins is less 
likely to occur. 

The shortest paths to the global optimum: become longer with increasing N, and for a given iV, they clearly increase with 
higher K. 

The outgoing weight distribution: indicate that, on average, the transition probabilities from a given node to neighbor nodes 
are higher for low K. 

The incoming weight distribution: indicate that, on average, the transition probabilities from the neighborhood of a node 
become lower with increasing K. 

The disparity coefficients: reflect that for high K the transitions to other basins tend to become equally likely, which is an 
indication of the randomness of the landscape. 

The previous results clearly confirm and justify from a novel network point of view the empirically known fact that NK 
landscapes become harder to search as they become more and more random with increasing K. 

The construction of the maxima networks requires the determination of the basins of attraction of the corresponding land- 
scapes. We have thus also described the nature of the basins, and found that the size of the basin corresponding to the global 
maximum becomes smaller with increasing K. The distribution of the basin sizes is approximately exponential for all N and K, 
but the basin sizes are larger for low K, another indirect indication of the increasing randomness and difficulty of the landscapes 
when K becomes large. Furthermore, there is a strong positive correlation between the basin size and the degree of the corre- 
sponding maximum, which confirms that the synthetic view provided by the maxima graph is a useful. Finally, we found that 
the size of the basins boundaries is roughly the same as the size of basins themselves. Therefore, nearly all the configurations in 
a given basin have a neighbor solution in another basin. This observation suggests a different landscape picture than the smooth 
standard representation of 2D landscapes where the basins of attraction are visualized as real mountains. Some of these results 
on basins in NK landscapes were previously unknown yj]. 
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This study represents our first attempt towards a topological and statistical characterization of easy and hard combinatorial 
landscapes, from the point of view of complex networks analysis. Much remains to be done. First of all, the results found should 
be confirmed for larger instances of NK landscapes. This will require good sampling techniques, or theoretical studies since 
exhaustive sampling becomes quickly impractical. Other landscape types should also be examined, such as those containing 
neutrality, which are very common in real-world applications, and especially the landscapes generated by important hard combi- 
natorial problems such as the traveling salesman problem and other resource allocation problems. Work is in progress for neutral 
versions of NK landscapes and for knapsack problems. Finally, the landscape statistical characterization is only a step towards 
implementing good methods for searching it. We thus hope that our results will help in designing or estimating efficient search 
techniques and operators. 
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